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ABSTRACT, - A computerized algorithm is presented for the examination of lists of scientific names, 
assigning current names to known synonyms and misspellings, and suggesting the most probable correct 
name for new combinations and new misspellings. The algorithm was implemented in a system that 
made use of W.N. Esehmeyef s fish genera database of 1990 and of species names and synonyms 
compiled in FishBase, a biological database on fin fish containing, at the time of this study 15,000 of the 
estimated 25,000 recent species of fish. The algorithm was tested with six independent data sets with 
altogether 10,711 scientific names. About 60% of these names either directly matched valid names, 
synonyms or known misspellings, or could be assigned automatically to a valid name. For the remaining 
4,385 names, the algorithm made suggestions which led to the identification of 1,218 synonyms, 798 
misspellings, and only 72 cases that needed more research and could not be decided based on the 
information provided by the algorithm. No match was found for 2,146 names referring mainly to spe¬ 
cies not yet contained in FishBnse. 

RESUME, - Un algorithme pour identifier les synonymes et les erreurs d onhographe dans les listes de 
noms scientifiques dc Poissons. 

Cet article presente un algorithme informatisd pour la verification de la nomenclature scientifi- 
que. Cet algorithme consiste h faire corresponds un nom scientifique a des synonymes ou noms mal 
orthographies ddj& repertories dans la base de donnees et, h ddfaut, a suggdrer le nom correct le plus 
probable en procedant a de nouvelles combinaisons avec les synonymes et noms mal orthographies les 
plus proches. Cet algorithme est realise au sein d l un systdme qui exploite la base de donndes sur les 
genres de poissons de W.N. Eschmeyer d at ant de 1990 ainsi que les noms et synonymes rdpertorids 
dans FishBase, une base de donates biologique sur les poissons con tenant 15 000 des 25 000 e spaces 
rdeentes de poissons estimdes. L 1 algorithme a dtd testd sur six sdries de donndes inddpendantes compo- 
sdes au total de 10 711 noms scientifiques. Environ 60% de ces noms ont pu etre associds ft des noms 
valables, a des synonymes ou & des noms mal orthographids connus, ou ont pu etre automatiquement 
rapportds & un nom valable. Pour les 4 385 noms restants, V algorithme a suggdrd des combinaisorts qui 
om about! a Tidentification de 1 218 synonymes, 798 mots mal orthographids et seulement 72 cas pour 
Lesquels aucune decision n’dtait possible & partir des informations foumies par Talgorithme. Aucun 
recoupement n’a pu dtre rdalisd pour 2 146 noms d’espdees non encore rdpertorids dans FishBase. 

Key words. - Taxonomic databases. Misspellings, Synonyms. Nomenclature. 


One of the more tedious tasks of taxonomists is to check and update long lists of 
species names that biologists, ecologists, authors, resource managers and, more recently, 
those with interest in biodiversity have sent to them for comment. Typically these lists 
include many synonyms and misspellings. Also, museums have begun to computerize 
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their collections and are faced with the problem of updating names that might have been 
assigned more than 200 years ago. Manual verification of scientific names can be a frus¬ 
trating task, as is demonstrated by Badly and Bureau (1995) who found 14 different spell¬ 
ings for a single species name (Bregmaceros mcctetlandi) in two bibliographic databases* 
as well as 14 different citations of the original publication. 

This paper presents an algorithm which largely automates and simplifies the proc¬ 
ess of verifying scientific names, A prototype of this algorithm has been implemented in 
FishBase, a biological database on finfish developed by the International Center for 
Living Aquatic Resources Management (ICLARM) in collaboration with the Food and 
Agriculture Organization of the United Nations (FAO) and numerous individual experts and 
institutions, with support from the European Commission (EC) (Pauly and Froese, 1991; 
McCall and May, 1995; Froese and Pauly, 1995, 1996), 

The goal of the algorithm is twofold: first, it aims to assign a valid name to as 
many as possible of the submitted names; second, it aims to provide all necessary infor¬ 
mation to assist the user in selecting a valid name for cases where a valid name can not be 
assigned automatically. 


MATERIALS AND METHODS 

The algorithm is based on the rules of the International Code of Zoological No¬ 
menclature (1985: 1CZN) and was optimized from the experience gained with ihe proto¬ 
type described below. It is divided into three parts which are documented in Appendix I, 
2 and 3, respectively. The chosen format should make it easy to implement the algorithm 
in any database programming language. 

The algorithm uses the following sources of information: 

L The FishBase FAMILIES table which contains the 469 family names of fishes as 

recognized by Fschmeyer (1990), plus additional information from other sources, notably 
Nelson (1994). The algorithm used only the field containing the valid family name. 

2, The GENERA table developed by W.N, Eschmeyer and corresponding to his 
monograph Catalog of Genera of Recent Fishes (Eschmeyer, 1990), which contains the 
more than 10,000 generic and subgeneric names used for fishes until the end of 1989. 
W,N, Eschmeyer made GENERA available for use and distribution through FishBase. For 
the purpose of this study, only four fields of GENERA were used: GenName, Status (as 
assigned by Eschmeyer, 1990), CurrentGenus, and Family; 

3. The FishBase SYNONYMS table, which contains about 15,000 valid names of 
the estimated 25,000 species of recent finfish plus about 24,000 synonyms classified in 
one of the following categories; junior synonym, original combination, new combina¬ 
tion, misspelling, misidentification, questionable, other: only synonyms of the first 4 
categories were considered. The algorithm used the following fields of the SYNONYMS 
table: SynGenus, SynSpecies (includes subspecies), SynAuthor, SynStatus, Valid 
(yes/no), SynRef (reference used to derive the status of the synonym), CurrentGenus, 
CurrentSpecies (includes subspecies). Current Author, and Family. 

In preparation for running the algorithm, a list of scientific names (binomia and 
trinomia) is imported into a table that allows additional information to be assigned to 
each name. The following 5 fields are provided for import: ImportGenus, Importspecies 
(includes subspecies). Import Author (with year and parentheses, if required), ImportFa- 
mily, and ImportCommonName, The examination of the names follows four major steps: 
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Masculine 

Feminine 

Neuter 

-U6 

-e 

-urn 

-is, no suffix 

-ia 

< 

j er 

-eta, -ri 

-erutn, -rum 

-tm 

-na 

-ns 

-X 

-X 

-X 


-fcs 

-as 

ior 

-ior 

HU* 


surnx 

Replacements 

Li 

i. no character | 

i 

ii, no character 

us 

ns, as, is 

ns 

us. as 

as 

us, ns 


Table II. - Frequent suffix misspellings 

Table f - Equivalent suffixes used for finding identical considered for finding matching 

(sub)specific epithets. (sub)specific epithets. 

L If a submitted name matches directly a valid, synonymous, or misspelled name i n 
FishBase, the algorithm adds the valid name, author and year, family, and the ref¬ 
erence on which the status of the valid name or synonym was based and proceeds 
with the next name (Steps I and 25 in Appendix 1). 

2. If no direct match is found, the algorithm tries to find unambiguous corresponding 
names by testing a combination with the valid genus * when the submitted genus 
was a synonym* and several modifications of the suffix of the (sub)specific epi¬ 
thet. The suffixes listed in table I are replaced with the corresponding suffixes in 
other genders; the suffixes listed in table II are replaced accordingly to account for 
common misspellings (Steps 6-24 in Appendix 1). If a match is found, the algo¬ 
rithm adds to the imported record the valid name, author and year, family, and the 
reference on which the status of the synonym is based. 

3. If Steps 1 and 2 have found no matching name, the algorithm checks the validity 
of the generic name. If a matching valid, synonymous or misspelled generic name 
is found, the algorithm adds the valid genus and family to the submitted name 
(Steps 2*5 in Appendix 1). If no matching generic name can be found, the algo¬ 
rithm tries to find similar names. It first tests for misspellings in the last one or 
two characters of the submitted generic name (Steps 1-5 in Appendix 2). It then 
tests for misspellings in the second and third character by replacing them with 
combinations of single wildcards, i.e,* a special character (?) that matches any 
single character (but not zero characters; Steps 6-9 in Appendix 2). Finally it re¬ 
places the fourth and following characters with a general wildcard, i.e., a special 
character (*) that matches zero or more characters. Generic names of fishes range 
in length from 2 to 23 characters (Eschmeyer, 1990). To limit the number of 
matching names, only those with similar length as the submitted name are consid¬ 
ered (Steps 10-14 in Appendix 2), The resulting comments and possible generic 
names are printed. Based on these suggestions, obviously misspelled generic 
names are corrected manually and the algorithm is rerun starting with Step I 
above. 

4. If Steps 1-3 have not found a matching name, the algorithm makes suggestions 
for those names for which at least a valid generic name had been found. Compari¬ 
sons are limited to (sub)specific epithets within the same family. It first tests for 
misspellings in the second and third character of the (sub)speciftc name (Steps 1* 
5 in Appendix 3). It then replaces the fourth and following characters with a gen¬ 
eral wildcard, (Sub)specific epithets contained in FishBase range in length from 2 
to 20 characters; only matching names of similar length to the submitted name are 
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accepted (Steps 6-10 in Appendix 3). Comments and possible names are printed. 
Based on these suggestions, further new combinations, synonyms and misspell¬ 
ings are identified and manually corrected. 

The algorithm was tested with the following data sets: a checklist from a recent 
book on Indo-Pacific fishes (Book): a curatorial database of a small museum collection of 
Indo-Pacific fishes (Museum 1): a fish collection database of a small museum in North 
America (Museum 2); a reference list used by a fisheries department in the Pacific 
(Fisheries); a species list from a trawl survey conducted in 1980 in the Indian Ocean 
(Trawl I); and a recent reference list used for trawl surveys in the Atlantic, Pacific and 
Indian Oceans (Trawl 2). These documents are not further identified here to protect their 
compilers from undue criticism and to emphasize that the errors they contained are typical 
of the class of documents they represent. 

The scientific names from these data sets were processed and the following statis¬ 
tics were recorded: 

=* Species: total number of submitted unique scientific names; 

=> Automatic match: number of submitted names that matched with a valid, synony¬ 
mous, or misspelled name or that could be automatically assigned to a valid name, 
as described in Step 2 above; 

=* Synonyms (Suggestions): number of synonymous names that couid be manually 
assigned to a valid name, using no other information than the provided printout; 

=> Misspelling (Suggestions): number of misspelled names that could be manually 
assigned to a valid name, using no other information than the provided printout; 

=> More research (Suggestions): number of names that required more information 
than provided in the printout; 

=> No match: number of names for which no match could be found. 


KESULTS 

The results of applying the algorithm to six data sets with altogether 10,711 sci¬ 
entific names are summarized in table III. About 60% of these names either directly 
matched valid names, synonyms or known misspellings, or could be assigned automati¬ 
cally to a valid name. See tables IV and V for examples of the resulting printout. For the 
remaining 4,385 names, the algorithm resulted in suggestions which led to the identifica¬ 
tion of 1,218 synonyms, 798 misspellings, and 72 cases that could not be decided based 
on the information provided by the algorithm. No match was found for 2,146 names, 
mainly because these species were nor yet contained in Fish Base. See table VI for an 
example of the suggestions that were the basis for manually assigning valid names to new 
misspellings and combinations. 


DISCUSSION 

A routine roughly similar to that presented here, he., capable of comparing scien¬ 
tific names against a reference list, exists for legumes (F. Bisby, pers. comm.). However, 
to date that approach has not been published and the author was not able to compare it 
with the approach suggested here. It is planned to provide an algorithm similar to the one 
presented here for use with the checklist of all known organisms that is currently devel¬ 
oped by the SPECIES 2000 initiative (Bisby and Smith, 1996). 
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Table UI. - Results of evaluating six lists of scientific names. See text for more details. 


fhitii sources 

Species 

Autum. 

match 

Synonym* 

(Suggestions) 

Misspelling 

(Suggf'ilion*) 

More research 

( Suggestions) 

Nu match 

Book 

2066 

L321 

44 

152 

9 

481 

Museum t 

1032 

691 

44 

85 

16 

168 

Museum 2 

3049 

1433 

361 

287 

L2 

956 

Fishery 

1378 

1026 

25 

98 

4 

218 

Trawl I 

970 

159 

670 

42 

L5 

63 

Trawl 2 

2216 

1696 

74 

134 

16 

260 

Total 

10711 

6326 

1218 

798 

72 

2146 


Table IV. - Example of a printout resulting from algorithm described in Appendix 1. Scientific names 
submitted for checking are in bold. Current status and author of the name according to the 
SYNONYMS table in FishBase are given in square brackets. The second line gives valid name, author, 
and reference on which the status of the synonym is based. 


Abantennarrus iinuiis [original combination, (jagclinc 1957| 

= Afttnwatius anatis (Goseline 1957), Ref, Pietsch, T.W and D,B, (irobecker. 19&7, p. 163 

Abtinicnnarius ncocaledoniensis (junior synonym, Lc Danoiii 19641 
- ArttrrwariuA rorriitrus (Lesson 1830), Ref Fictsi'h, TW. and D,B Grnbecltcr. 1987 

Abroad aurora [new combmaiirm, (Iordan & Gitben I 

~ MicTQfnrTruSCuront (Jordan & Gilbert 1880) Ref Hschmeyer, W.N., F.Y Hera Id and H H^mrnann. 19X3, p 231 

Abtabys bin of at a (misspelling, (Peters 1X55)] 

= Ahfahys binotatus (Pelers 1855), Ref. Pnns, S . 198.6, p. 479 

Abrtsmh pekineitxh |original combination, Basilcwskv 1855] 

= Parabramispekinrwts (RasiTewsky 1855), Ref. Berg, L&, 1964. p.358 

A bra m ite j .1 ohirii (j an ior sty nony m, ' H j Jlmbcrg J 891) | 

= Abnmutes hypselcnatus (Giinlher 1868), Ref Van, R P. and A M W iihams, 1987, p. 89 


Table V, - Example of a printout resulting from the algorithm described in Appendix 2. Generic names 
from ihe submitted list that do match synonyms in Eschmeyer’s (1990) GENERA database are shown in 
bold. The author of the name is given within the square brackets. 


Abanfrnnarius jSchtiitr 1957. synonym] 

- Aufitmariiii, Daudla 1816 

Aheona [Girard 1855, synonym] 

= Afkromefrus, Gibbons 1854 

A brama c vphaiu \ \ JJtcindacb ner J 869, ?,y nuny ro| 

- Nypuphihaimirhrhys, Bleeker 1859 

A branches [Smith 1947 synonym] 

= Gobiopsis, Sleindachnc^ 1X60 

Ahraitomus [Smith ]84L synonym] 

- Lobro t Cuvier 1816 

Abryais [Jordan &. Snyder 1902, synonym] 

= PhoUdopw,' Bean &. Bean 1896 


The presented algorithm is a preliminary approach to identifying synonyms and 
misspellings in lists of scientific names. It is straightforward for valid names and re- 
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corded synonyms and misspellings and its effectiveness for these categories solely de¬ 
pends on the completeness of the underlying GENERA and SYNONYMS table. The ap¬ 
proach to assigning valid names automatically to unrecorded misspellings and combina¬ 
tions was a cautious one. According to ICZN, the suffix of a (sub)specific epithet that is or 
ends in a Latin adjective must agree in gender with the generic name (Article 31(b), Inter¬ 
national Code of Zoological Nomenclature, 1985}. Because of this variability and because 
the gender of the generic name and the typ'j of the (sub Specific epithet (adjective or noun 
in apposition) are not always dear, allowed and erroneous variations in the suffix of a 
(sub)specifk epithet are more frequent than in the stem. Since it is the stem and not the 

Table VL - Example of a printout resulting from the algorithm described in Appendix 3. Submitted 
scientific names and authors are given in bold. Indented lines show similarly spelled specific epithets 
from within the same family, with their status and the reference on which the status is based. An ‘Esch/ 
indicates that the spelling of the scientific name and author have been checked against a 1996 version 
of Eschmeyer's PISCES database. The user has to assign manually a valid species to the submitted 
name. Note that the author name is essential for correct assignments. Phoxinellus handlirschi is a new 
combination of Acamhonailus handlirschi Pietschmann 1933, Fundus katalo is a misspelling of P. katalo 
(Herre, 1924), and Solegnarhus dunckeri (Whiteley, 1927) is a valid name not yet contained in 
FishBase. 


Phaxinettus handhrpchi iPiet^ihm.mn 1933) 

handlirsebi, Acanthomtihis Picuchmaiin 1933, Valid [Robins. C.R., R.M, bid Icy . C\E, Bond, LR, Brooker 
E.A. Lachner, R.N. Lea and W.B. Scott, 1991, p. 36| Esch. 

/HwAiitfflBi, ttybopinthus Hubbs 1927, Valid [Pagt:, L.M. and EM. Burr, 1991, p, 1Q9, 374) 

Puutius kutvfu (Herrc 1924) 

kaUmgaCy Baibas Boulcngor 1900. junior synonym of Barbusmtmiculatus Peters 1852 [Levcque, C, and J. 
Daget* 1984. p. 299-3001 

katangae, Chelarthiops Poll 1948 junior synonym of Chetaethiaps congicus (Nichols & Grisnom 1917) 
p.cvcque. C. and I, Dngcl. 1984, p. 299-300) 

kaiinrnsis. XenOcypris Tdorng 1930, questionable of XciUKppris mttcrolepis BIclTcj 1871 | Berg, US,. 1964, 
p. 173) Each. 

katalo. Barbales Hcrre 1924. orieinul ; ambuiation of Pun:ius karoh iHerre 1924) [Herre, A,W„ 1953, p. 125} 
Esdt 

katalo. Punlius 1924). Valid [Hcire, A.W., 1953, p. 125} Each. 

&nlet(titiifius dunckeri (Whitel<?\ 1927) 

dunckeri* Sytigiwthiv Met/claitr i9l9, one inn I combination ol Bryxdunckrri (Metzdaar 1919) [ Rob ins. C,R,, 
R.M, Bailey. C\E. Bond. J,R Blanker. E,A,I ai,hncr R.N, Lea and W.B, Scott. 1991. p. 37, 84) 
dunckeri, Bry.x (Mctzdaar 1919), Valid [Robin*. t\R,, R.M Bailey, C,h. Bond, J.R. Brouter. E.A La^hner. 

R.N. Lea and W.B. Scott. 1991. p, 37. 84} 
dunckeri T HalicampUA (fhahonaud 1929), Vahd [Myers. R.l\. I 4 * 1 * L p, 88, 89) 

dunefceri* Miaoph is (Pra-had & Mukeiji 1929). Valid [Talwar. PK and A,G. Hi in gran 1992. p, 772] Fseh. 
dunckeri, Doryichthys Pra^had & Mukerji 1929, original combination ol Micropkis tlwickeri (fYashad & 
Mukerji 1929) flalwar, P.K. and A G. Jhingran, 1997. p, 7 221 Esch. 
dunckeri, hticrognathus C babariaud 1979, original combination ol Halicampus dtmekeri (Chabimaud 1929} 
_[Dawson, C. 1986. p. 451]_ 





Algorithm for checking lists of scientific names of Fishes 


271 


suffix that differentiates (sub)specific epithets* certain variations in the suffix can be 
safely ignored in the matching process. 

It was assumed safe to assign valid names automatically if: i) the submitted genus 
was identical with or a known synonym of the valid genus: ii) the stem of the submitted 
(sub)specific epithet matched that of the valid name; and iii) the suffix of the submitted 
(sub)specific epithet matched directly, or was of another gender (Table I), or one of a few, 
common misspellings (Table II). Note that the sequence in step 8* Appendix 1, ensures 
that, e.g., the suffix "urn" is not found before "emm", If a safe method could be devised to 
distinguish automatically between the stem and the suffix - including irregular adjectives 
such as pains ter, palustris, palustre and major, majus or nouns without suffix such as 
cuvter - the suffix might be ignored altogether and the matching of (sub)specific epithets 
might be based on the stem only. 

The automatic matching routine is based only on the submitted binomina or tri- 
nomina, and ignores additional provided information on, e.g*, families or authors* Such 
information can theoretically be used, for example, to search for misspelled genera only 
within a given family, or use the authority to distinguish between identical (sub)speeific 
epithets within a family, However, the assignment of genera to families can follow differ¬ 
ent taxonomies, and the spelling of names of authorities differs widely. Such additional 
information is utilized to assist the user in matching the name submitted with similarly 
spelled names in the provided printout (see table VI), 

For generic names, automatic assignment by replacing the last one or two charac¬ 
ters with, e.g., single wildcards was not an option, because differences in these characters 
can actually describe distinct genera, such as in Acanthogobio Herzenstein 1892, in 
Cyprinidae and Acamhogobius Gill 1859, in Gobiidae (Eschmeyer, 1990)* The algorithm 
as presented here does not accommodate checking of subgeneric names (tetranomia). 
However, Eschmeyer*s GENERA contains also subgeneric names and treats them as syno¬ 
nyms of the respective genus. Thus, if ihe submitted generic name was considered subgen¬ 
eric by Eschmeyer (1990), the scientific name would be accepted and the status of the 
generic name would be pointed out in the printout resulting from the procedure described 
in Appendix 2. 

It might be asked why the algorithm does not start with the examination of the 
generic name, as would seem logical. The used sequence ensures that information con¬ 
tained in the synonymy is considered before the generic name is used to assign a family to 
the submitted name (Step 3 above; Steps 2-5 in Appendix 1), For example, Chromis 
niloticus is a known synonym of Oreochromis niloticus in Cichlidae. However, Chromis 
is a valid genus in the family Pomacentridae. Thus, the algorithm makes use of the fact 
that the binomen contains more information than either the generic name or the specific 
epithet alone. 

The approach used for finding new misspellings could be improved to avoid offer¬ 
ing names that are clearly non-matching (see table VI)* For example, one could subse¬ 
quently replace each character with a single wildcard, two single wildcards, and no charac¬ 
ter to account for mistyped, additional, or missing characters, respectively* However, 
every such combination has to be compared with sometimes more than 1,000 names, 
which increases the search time considerably while still not finding cases where more 
than one character are mistyped or where subsequent characters are inverted, both of which 
happen quite often. The algorithm found most of such cases without suggesting too many 
obviously different names. For example, of the 14 different spellings of Bregmaceros 
mcclellandi listed in Bailly and Hureau (1995), two ( B mcclellandi (valid) and B, macclel- 


272 


Froese 


iandii (misspelling)) were contained in the FishBase SYNONYMS table and thus matched 
directly, nine could be automatically assigned to the valid name, two were suggested as 
being similar to the valid name, and one spelling ( B . Mac Clellandii) which is not permit¬ 
ted under ICZN, was not found. 

A search using the sound-alike wildcard (~) of two database systems (DataBase* 
1990 and Microsoft Access, 1994) did not produce useful results. This, however, could 
probably be improved if the underlying routine could be trained on scientific names. 

Some colleagues have compared the algorithm proposed here to a spell-checker, 
but this comparison ts misleading. A spell-checker makes use of pattern-matching rou¬ 
tines to find similar combinations of characters without any consideration of what the 
words mean. The algorithm presented here makes use of the information contained in the 
taxonomic hierarchy to restrict the search for a (sub)specific epithet to the corresponding 
genus or family; it also makes use of the synonymies contained in the latest revisionary 
work, notably Eschmeyef s recent revision of the generic names. It also makes use of the 
regulations of the International Code of Zoological Nomenclature (1985) by distinguish* 
mg between the stem and the suffix of a (sub)specific epithet. 

The six data sets tested stemmed from a variety of sources (see above) and are 
probably representative of similar compilations. All data sets were recent with the excep¬ 
tion of Trawl 1 which was assembled in 1980 and which had the lowest percentage (16%) 
of directly matching names. Overall, the algorithm was able to assign valid names auto¬ 
matically to about 60% of the names submitted. If we assume that the 2J46 names in the 
"No match" category are not yet in FishBase and exclude these names from the compari¬ 
son, the percentage of direct hits was 74%. In other words, with a database containing all 
species of a group and with a synonymy similar to that of FishBase, the algorithm can be 
expected to find automatically, on the average, corresponding valid names for three out of 
four submitted names. Conversely, similar lists of scientific names can be expected to 
contain 26% new combinations or misspellings. 

For the 4,386 names for which no automatic match could be found, the algorithm 
made 2,016 (46%) acceptable suggestions and identified 72 cases that could be solved 
with additional information. If we again exclude the "No match” names from this consid¬ 
eration, 90% of the suggestion made by the algorithm led to the correct assignment of a 
submitted new combination or new misspelling. 
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page of the California Academy of Sciences at: 

http i //www, emlacftdny, or g/research/Ichthyology /*p*ei**/ 


REFERENCES 

BAtLLY N, 8l L-C. HUREAU, 1995.- Bases de donndes en biologic; quelques probldmes lids & La 
nomenclature et aux rdfdrences bibliographiques. Cybium* 19(4): 323-324. 


Algorithm for checking lists of scientific names of Fishes 


273 


BERG L,S,, 1964, - Freshwater fishes of the U.S.S.R. and adjacent countries, Vol. 2, 4th Edition, Israel 
Program for Scientific Translations Ltd., Jerusalem. 

BISBY F. & P. SMITH, 1996. - Species 2000 Project Plan, 44 p, Umv, Southampton, Southampton, UK, 
DATABASE INTERNATIONAL, 1990. - DataBase reference manual, Trumbull, USA, 

DAWSON G, 1986. - Syngnathidae, pp. 445458. In: Smith's Sea Fishes (Smith M,M, and P.C 
Heemstra, eds). Springer-Verlag, Berlin. 

ESCHMEYER W.N„ 1990. - Catalog of the Genera of Recent Fishes. 697 p. California Academy of 
Sciences, San Francisco, 

ESCHMEYER W.N., HERALD E.S. & H. HAMMANN, 1983, - A held guide to Pacific coast fishes of 
North America. 336 p, Houghton Mifflin Company, Boston. USA, 

FROESER, & D, PAULY, eds, 1995, - FishBase: a biological Database on Fish (Version 1.2). 146 p, 
ICLARM, Manila, Philippines. 

FROESE R, & D. PAULY, eds., 1996. - FishBase 96: concepts, design and data sources. 179 p. 
ICLARM, Manila, Philippines, 

HERRE A.W., 1953, - Check list of Philippine fishes. Res. Rep, U S. Fish, Wildl. $erv„ (20): 977 p. 
INTERNATIONAL CODE OF ZOOLOGICAL NOMENCLATURE, 1985. - International Trust for 
Zoological Nomenclature. 338 p. 

L£v£QUEC. & J, DAGET. 1984.- Cyprinidae, pp, 217-342. In: Check-List of the freshwater Fishes 
of Africa (CLOFFA), Vol, 1 (Daget L, Gosse J,-P, & DF.E. Thys van den Audenaerde, eds), 
OR S TOM, Paris and MR AC. Tervuren. 

McCALL R.A. & R.M* MAY, 1995. - More than a seafood platter. Nature t 378: 735. 

MICROSOFT, 1994, ‘Microsoft Access language reference. Microsoft Corporation, USA. 

MYERS R.F., 1991. - Micronesian reef fishes. Second Edition. 298 p. Coral Graphics, Bamgada, 
Guam. 

NELSON J.S., 1994, - Fishes of the World. Third Edition. 600 p. John Wiley & Sons, New York. 

PAULY D, & R, FROESE, 1991, - FishBase: Assembling Information on Fish, Naga> ICLARM Q. t 14(4): 
10-1L 

PIETSCH T,W. & D,B. GRQBECKER, 1987. - Frogfishes of the World. Systematica, Zoogeography, 
and behavioral Ecology, 420 p. Stanford Univ. Press, Stanford, California. 

POSS S. ( 1986. - Tetrarogidae, pp, 479. In: Smith's Sea Fishes (Smith M.M, & P.C, Heemstra, eds). 
Springer-Verlag, Berlin, 

ROBINS CR„ BAILEY R.M, ( BOND C,E„ BROOKER J,R„ LACHNER E.A„ LEA R.N. & W.B. 
SCOTT, 199 L - Common and scientific Names of Fishes from the United States and Canada. 
Amer, Fish, Soc. Spec, PubL, (20): 183 p, 

ROBINS C.R., BAILEY R.M., BOND C.E., BROOKER J.R., LACHNER E.A.. LEA R.N. & W.B. 
SCOTT, 1991. - World Fishes important to North Americans. Exclusive of Species from the 
continental Waters of the United States and Canada, Amer. Fish. Soc, Spec, PubL, (21): 243 p. 
TALWAR P,K. & A G. JHINGRAN. 1992, - Inland fishes of India and adjacent countries. Vol. 2. A.A. 
Balkema, Rotterdam, 

VARIR.P. & A,M. WILLIAMS, 1987. - Headstanders of the neotropical anastomid genus Abramites 
(Pisces: Characiformes: Anostomidae). Proc. Biol Soc. Wash., 100(1): 89-103. 


Regu le 20.09.1996 . 

Accepts pour publication le 30,01.1997. 


274 


FROESE 


APPENDIX I 

Algorithm to automatically assign a valid name to a submitted list of zoological names. See ta¬ 
ble IV for an example of a resulting printout. Note that we refrained from summarizing steps to facili¬ 
tate translation into any database query language. 

la Submitted name matches a valid name, a synonym, or a 

recorded misspelling...-.-— 25 

lb Submitted name does not match.-....2 

2 a (lb). Generic name val id «— -——*— -—-—-*-——* 5 

2b (lb). Generic name synonym, recorded misspelling, or unknown-*--—-— 3 

3a (2b). Generic name synonym or recorded misspelling-*---4 

3b (2b). Generic name unknown .—. 26 

4a (3a). Combination of valid Genus and submitted (subspecific name 

matches a valid name, a synonym, or a recorded misspelling —.25 

4b (3 a), Combi nati on of val id Genus and s ubmitted 

(sub)specific name does not match ----—----- 5 

5 (2a, 4b), Action: Add valid Genus and Family to the submitted name-6 

6a (5), Submitted name is a trinomen ----—---*—7 

6b (5). S ubmitted name is a bi nomen..—-.—.—.-—.-8 

7a (6a), Combination of submitted or valid Genus and 

subspecific epithet matches a valid name, a synonym, 

or a recorded misspelling ™—*——-—■-----25 

7b (6a). Combination of submitted or valid Genus and 

subspecific epithet does not match —..—~.-.-— 8 

8a (6b, 7b). Submitted specific or subspecific name ends on “erum". 9 

8b (6b, 7b), Submitted specific or subspecific name ends on ‘era”.-. 10 

8c (6b, 7b). Submitted specific or subspecific name ends on "er”.. It 

8 d (6b, 7b). Submitted specific or subspecific name ends on “ior" .-. 12 

8e (6b, 7b). Submitted specific or subspecific name ends on “ius” . 13 

8f (6b, 7b), Submitted specific or subspecific name ends on “is"...-. 14 

8 g (6b, 7b), Submitted specific pr subspecific name ends on “us”. 15 

8h (6b, 7b>, Submitted specific or subspecific name ends on “um”... 16 

8i (6b, 7b). Submitted specific or subspecific name ends on “ii" —.-.- 17 

8j (6b, 7b). Submitted specific or subspecific name ends on “i” .. 18 

8k (6b, 7b). Submitted specific or subspecific name ends on “a” -- 19 

81 (6b, 7b)* Submitted specific or subspecific name ends on “e”.-. 20 

8m (6b, 7b). Submitted specific or subspecific name ends on “ns" ——————. 21 

8 n (6b, 7b), Submitted specific or subspecific name ends on “as".-...— 22 

So (6b, 7b), Submitted specific or subspecific name ends on “ra" ————————- 23 
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8p 

(6b, 7b). 

Submitted specific or subspecific name ends on “rum”.. 

Sq 

(6b, 7b). 

Submitted specific or subspecific name ends on none 

of the above suffixes .-.-.-— 

9a 

(8a). 

Combination of submilted or valid Genus and modified (sub)specific 

name with the last four characters replaced by “era” or ”er” 

matches a valid name, a synonym, or a recorded misspelling. 

9t> 

(8a). 

Combination of submitted or valid Genus and modified (sub)speciftc 
name does not match .—-—-...-.. 

10a 

(8b). 

Combination of submitted or valid Genus and modified (sub)speeific 
name with the last three characters replaced by “crum” or “er” 
matches a valid name, a synonym, or a recorded misspelling-*--— 

10b 

(£b>. 

Combination of submitted or valid Genus and modified (sub)specific 
name does not match .-.-. 

11a 

(8c). 

Combination of submitted or valid Genus and modified (sub)spedfic 
name with the last two characters replaced by "era” or "erum” 
matches a valid name, a synonym, or a recorded misspelling ..-.— 

lib 

(8c). 

Combination of submitted or valid Genus and modified (subSpecific 
name does not match .-—..-..— 

12a 

<8d). 

Combination of submitted or valid Genus and modified (subSpecific 

name with the last three characters replaced by “ius” 

matches a valid name, a synonym, or a recorded misspelling.- 

12b 

(8d). 

Combination of submitted or valid Genus and modified (sub specific 
name does not match —-—-.-.*—.*.— 

15a 

<8e). 

Combination of submitted or valid Genus and modified (sub)speciftc 

name with the last three characters replaced by "ior” 

matches a valid name, a synonym, or a recorded misspelling.. 

13b 

(8e). 

Combination of submitted or valid Genus and modified (sub)specific 
name does not match ——-.-.—.... 

14a 

(80. 

Combination of submitted or valid Genus and modified (sub)specifie 

name with the last two characters replaced by “e” 

matches a valid name, a synonym, or a recorded misspelling .. 

14b 

(8f), 

Combination of submitted or valid Genus and modified (subSpecific 
name does not match -.—— ..——- 

15a 

(Be). 

Combination of submitted or valid Genus and modified (sub)spectfic 
name with the last two characters replaced by “a” or “um” or “ns” or “as” 
matches a valid name, a synonym, or a recorded misspelling —-——- 

15b 

(8g). 

Combination of submitted or valid Genus and modified (sub>specific 
name does not match *—*——-.—. -. 

16 a 

(8b), 

Combination of submitted or valid Genus and modified (sub)specific 

name with the last two characters replaced by "a” or "us” 

matches a valid name, a synonym, or a recorded misspelling .. 

16b 

(8h). 

Combination of submitted or valid Genus and modified (sub)specific 
name does not match . -. . .— — . . 


24 

26 

25 

26 

25 

26 

25 

26 

25 

26 


25 

26 


25 

26 


25 

26 


25 

26 
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17a 

(8i). 

Combination of submitted or valid Genus and modified (sub)specific 
name with the last two characters replaced by "i" or removed (deleted) 
matches a valid name, a synonym, or a recorded misspelling 

- 25 

17b 

(Si). 

Combination of submitted or valid Genus and modified (sub)specifk 
name does not match -*— -—-*-—-— 

- 26 

18a 

(Sj). 

Combination of submitted or valid Genus and modified (sub)speciftc 

name with the last replaced by "it" or removed (deleted) 

matches a valid name, a synonym, or a recorded misspelling ——. 

- 25 

ISb 

(8j). 

Combination of submitted or valid Genus and modified (sub)specific 
name does not match ---—------—-- 

- 26 

19a 

(8k). 

Combination of submitted or valid Genus and modified (sub)specific 

name with the last character replaced by "us” or "um" 

matches a valid name, a synonym, or a recorded misspelling.— 

- 25 

19b 

(8k). 

Combination of submitted or valid Genus and modified (sub)specific 
name does not match -.——.—-.——---— 

~ 26 

20a 

(81). 

Combination of submitted or valid Genus and modified (subSpecific 
name with the last character replaced by "is" 

matches a valid name, a synonym, or a recorded misspelling -—-. 

- 25 

20b 

(81). 

Combination of submined or valid Genus and modified 

(sub)speciftc name does not match —-—.....- 

- 26 

21a 

(8mJ. 

Combination of submitted or valid Genus and modified (sub)specific 

name with the last two characters replaced by "us" or "as" 

matches a valid name, a synonym, or a recorded misspelling-—.— 

- 25 

21b 

(Sm). 

Combination of submitted or valid Genus and modified* sub)specific 
name docs not match .-.-.-.. 

- 26 

22a 

(8n), 

Combination of submitted or valid Genus and modified (sub)specific 

name with the Iasi two characters replaced by "ns" or "us" 

matches a valid name, a synonym, or a recorded misspelling .—. - 

- 25 

22b 

(So). 

Combination of submitted or valid Genus and modified (sub)spccific 
name does not match . - . - . -——. 

~ 26 

23a 

(So). 

Combination of submitted or valid Genus and modified (sub)specific 

name with the last two characters replaced by "er” or "rum" 

matches a valid name, a synonym, or a recorded misspelling . —- 

- 25 

m 

(So). 

Combination of submitted or valid Genus and modified (sob)specific 
name docs not match .-.—.—. 

- 26 

24a 

(8p). 

Combination of submitted or valid Genus and modified (sub Specific 

name with the last three characters replaced by "er" or "ra 11 

matches a valid name, a synonym, or a recorded misspelling . 

- 25 

24b 

(8p). 

Combination of submitted or valid Genus and modified (sub)specific 

nsim** nrtf matfli _____ 

-- 26 

25, 


JMiitL- UUv£] IIUI lUtUUl —-------- 

Action: Add valid name, author and year, family, and reference 

th thf ■siihtnift______________ 


26. 


iv lilV JlH/llliliIvU 11411 IV 

Action; Proceed with next name in list If end of list, end procedure. 
Continue with procedure described in Appendix 2. 




















Algorithm for checking lists of scientific names of Fishes 


277 


APPENDIX 2 

Algorithm for suggesting similarly spelled generic names for cases where no matching generic 
name had been found. Comments and possible genera are printed. The user can correct or replace the 
submitted generic names and renin the procedure described in Appendix 1. 

la Submitted generic name with last character replaced by 

one single wildcard matches one or more valid, synonymous, 

or misspelled generic name(s)....— 14 

lb Modified generic name does not match.2 

2a (lb). Submitted generic name with last character replaced by 

two single wildcards matches one or more valid, synonymous, 

or misspelled generic name(s) --—-—-—-----* 14 

2b (lb). Modified generic name does not match -—.—...*.3 

3a (2b). Length of submitted generic name more than two characters .— 4 

3b (2b), Length of submitted generic name not more than two characters . — 13 

4a (3a). Submitted generic name with last two characters replaced by 

two single wildcards matches one or more valid, synonymous, 
or misspelled generic name(s) —..-.-. 14 

4b (3a). Modified generic name does not match.- -.5 

5a (4b), Submitted generic name with Last two characters replaced by 

one single wildcard matches one or more valid, synonymous, 
or misspelled generic name(s) -—-.-... 14 

Sb (4b). Modified generic name does not match.-.——.6 

6a (5b). Length of submitted generic name more than three characters.*—- 7 

6b (5b). Length of submitted generic name not more than three characters ——— 13 

7a (6a), Submitted generic name with second and third character each 

replaced by a single wildcard matches one or more valid, 
synonymous, or misspelled generic name(s)...*.— 14 

7b (6a). Modified generic name does not match ————.-.—.8 

8a (7b). Submitted generic name with second and third character 

together replaced by a single wildcard matches one or 
more valid, synonymous, or misspelled generic name(s) —.— 14 

8b (7b). Modified generic name does not match —-.-—-----.- 9 

9a (8b), Submitted generic name with second and third character 

together replaced by three single wildcards matches one or 

more valid, synonymous, or misspelled generic name(s) —-. 14 

9b (8b), Modified generic name does not match...*— --- 10 

10a (9b), Length of submitted generic name more than ten characters —.. 11 

10b (9b). Length of submitted generic name more than five characters—..*12 

10c (9b). Length of submitted generic name less than six characters——.13 
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11a (10a), Submitted generic name with fourth and following diameters replaced by 

a general wildcard and with the matching name of a length equal to 
the length of the submitted name +A two characters matches one 


or more valid, synonymous, or misspelled generic name(s). 14 

lib (10a), Modified generic name does not match.—.-— 13 


12a (10b), Submitted generic name with fourth and following characters replaced by 

a general wildcard and with the matching name of a length equal to 
the length of the submitted name +/- one character matches one 


or more valid, synonymous, or misspelled generic name(s) -—-. 14 

12b (10b). Modified generic name does not match.-—-. 13 

13 Action: Print submitted generic name with comment: 


ll No matching or similarly spelled generic name could be found ” 
Proceed with next name. If end of list, end procedure. 

Continue with procedure described in Appendix 3. 


14 Action: Print submitted generic name and all matching or 

similarly spelled generic names, the status of the names, 
the corresponding valid generic name, and the family. 
Proceed with next name. If end of list, end procedure. 

Continue with procedure described in Appendix 3, 
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APPENDIX 3 

Algorithm for suggesting similarly spelled specific or subspecific epithets from within the same 
family; the routine only considers submitted names that have a family but still no valid name assigned to 
them: comments and possible scientific names are printed. Based on the printout, the user can correct or 
replace submitted names and renin the procedure described in Appendix 1. 

la Length of submitted (suh)specific epithet equals two characters —————— 2 

lb Length of submitted (sub)specific epithet more than two characters.—— 3 

2a {la). Submitted (sub)spedfic epithet with second character replaced by 

a single wildcard matches one or more valid, synonymous, or 
misspelled (sub)spectfic epithet(s) within the same family. 10 

2 b (la). Modi Tied sub(sped fi c) ep it he r does not mate h.9 

3a (lb). Submined (sub)specific epithei with second and third character each 

replaced by a single wildcard matches one or more valid, synonymous, 
or misspelled (sub)spedfic epithet(s) within the same family —-.—. 10 

3b (lb). Modified sub{specific) epithet does not match -.-..——4 

4a (3b). Submitted (subspecific epithet with second and third character together 

replaced by a single wildcard matches one or more valid, synonymous, 

or misspelled (sub)specific epithet(s) within the same family.——— IQ 

4b (3b). Modified sub(specific) epithet does not match ———..——5 

5 a (4b). Submitted lsub specific epithet with second and third character together 

replaced by three single wildcards matches one or more valid, synonymous, 
or misspelled (sub)specific epithet(s) within the same family —-.- 10 

5b (4b), Modified sub(specific) epithet does not match .-—-. 6 

6 a (5b). Length of submitted (sub)specific epithet more than 10 characters -—— 7 

6 b (5b). Length of submitted {sub)specific epithet more than 3 characters. 8 

6 c (5b). Length of submitted (sub)specific epithet less than 4 characters —-—-- 9 

7a ( 6 a). Submitted (sub)specific epithet with fourth and following characters 

replaced by a general wildcard and with the matching name of a length 
equal to the length of the submitted name +/- two characters 
matches one or more valid, synonymous, or misspelled 

(subSpecific epithet(s) within the same family ——..— 10 

7b ( 6 a). Modified {subjspectftc epithet does not match -*——-..-— 9 

8 a ( 6 b). Submitted (sub)specific epithet with fourth and following characters 

replaced by a general wildcard and with the matching name of a length 
equal to the length of the submitted name +/- one character 
matches one or more valid, synonymous, or misspelled 

(sub)specific epithet(s) within the same family -.— 10 

8 b ( 6 b). Modified (subspecific epithet does not match . 9 
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9 Action: Print submitted name and comment: "No matching or 
similarly spelled {sub) specific epithet could be found within this 
family," Proceed with next name. If end of list, end procedure, 

10 Action: Print submitted name and all matching or similarly 
spelled names, the status of the names, the corresponding 
valid name, the reference, and the family. 


Proceed with next name. If end of list, end procedure. 


